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Abstract 

In this paper, some issues concerning the Chinese remaindering 
representation are discussed. Some new converting methods, including 
an efficient probabiUstic algorithm based on a recent result of von zur 
Gathen and Shparlinski are described. An efficient refinement of 
the NC^ division algorithm of Chiu, Davida and Litow [2j is given, 
where the number of moduli is reduced by a factor of logn. 
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1 Introduction 

For the fundamental arithmetic operations, it is often desirable to represent 
an integer as a vector of smaller integers. This can be done by selecting a set 
of pairwise coprime positive integers mi,m2, • • • , rrir, and mapping an inte- 
ger X to the vector of residues |a^|m2) ' ' ' > l^lm,.)) where \x\m^ denotes 
X (mod rrii). This approach is called the Chinese remaindering represen- 
tation (CRR), as the Chinese remainder theorem (CRT) guarantees such 
mapping is meaningful. Using CRR, large calculations can be split as a 
series of smaller calculations that can be performed independently and in 
parallel. So, this approach has a significant role to play in applications such 
as cryptography and high precision scientific computation. 
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It is well known that three basic arithmetic operations, addition, sub- 
traction, and multiplication, can be performed in O(logn) time using n'^^^^ 
processors. These operations can also be done in the manner of log-space 
uniform. However, the parallel complexity of integer division is a subtle 
problem and has attracted a lot of attention. The first 0(log?7-) time n^^^^ 
sized circuit for integer division was exhibited by Beame, Cook and Hoover 
[1]. Recently, the log-depth, polynomial size, logspace-uniform circuit fam- 
ily for integer division (i.e., integer division is in logspace-uniform NC^) was 
described by Chiu, Davida and Litow [2]. This settled a longstanding open 
problem and provided an optimal computation efficiency theoretically. 

In this paper, we discuss some issues concerning the Chinese remainder- 
ing representation. The organization of the paper is as follows. Section 2 
describes the Chinese remaindering system. Two methods for converting a 
vector to the corresponding integer are presented in this section. Section 3 
focuses on the integer division using CRR. Under the framework of NC^, an 
efficient refinement of the division algorithm of Chiu, Davida and Litow [2] 
is proposed. 

2 Chinese Remainder Representation 

Let A4 = {nil, 1712, ■ ■ ■ , nT'r} be a set of pairwise coprime integers and M = 

r 

JJmj. For a set of integers xi,X2, ■ ■ ■ ,Xr with < Xj < rrii, the Chinese 

i=l 

Remainder Theorem says that the system of congruence 

X = xi (mod mi) 
X = X2 (mod 7712) 

X = Xr (mod mr) 

has a unique solution < x < M. In fact, using the extended Euclidean 
algorithm, one finds integers ui , U2 , • • • , Ur such that 

Z^Ui — = 1, 
^-^ mi 

1=1 
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and it is easy to verify that 

x = ^S^XiUi — (mod M) (1) 

^ rrii 
1=1 

gives the desired solution. It is remarked that one can also choose Ui = 
M 

( — ) (mod rrii); and such choice of Ui will be used in the rest of our 
discussion. 

The above system is called a Chinese remaindering representation (CRR) 
based on the set M., and is denoted by CRR(A4). 

Now we present a method of finding Ui's which can be seen as an alter- 
native to the Garner algorithm described in [7] (pages 290,293). 

For each j > 1, rrij is coprime to mi ■ ■ ■ rrij^i. Therefore, by the extended 
Euclidean algorithm, there exist integers Cij,f3j such that 

ajirij + f3jmi ■ ■ ■ rrij^i = 1. (2) 

With these r — 1 pairs of (aj,/3j), the coefficients Ui can be computed as 
follows: 

Ul <— 0203- ■ -Or (mod mi) 
U2 ^ P2a3 ■ ■ - ar (mod 771-2) 
■"3 *— Pa^A ■ ■ ■ Or (mod ms) 

Ur <— Pr (mod rrir) 

The correctness of the above algorithm is based on the following identity: 

(q2 • • • 0^)77127713 • • • rrir + (/32Ci3 ■ • • ar)mim3 • ■ ■ rrij. + 
{(izot/i ■ ■ ■ ar)mim2m4 ■ ■ ■ rrir + • • • + f3rmim2 ■ • • rrir-i = 1. 

This identity can be verified using the standard mathematical induction: 
for 7 > 2, suppose that 

(02 • • • aj_l)777277l3 • ■ ■ 777j_i + (/?2a3 ' " " aj-l)777i7773 • ■ ■ 77lj_i + 
(13304 ■ ■ ■ Oj- 1)7711 7772 7774 " ' ' "Ij-l H h /3i_l777 i7772 ' • • 777j_2 = 1. 
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Multiply both sides of the above by aiirii, and apply the equation ([T]) for 
j = i, one gets 

(a2 • ■ ■ ai)m2m3 • • • + (/32a3 • ■ ■ Oii)mim^ • • • mj + 
{(i^Ui ■ ■ ■ Qj)mim2m4 • • • mj H h Pimim2 ■ ■ ■ rrii-i = 1. 

It is remarked that in this process, we call the extended Euclidean algo- 

r(r — 1) 

rithm r — 1 times. For the method described in [7j, instances of 

extended Euclidean algorithm need to be invoked, for pairs (mj,mj) with 
i < j. 

Next we present a probabilistic converting method for CRT. For positive 
integers Ni,N2, let ai,a2,--- ,ar be in {1,2,- ■■ ,A^i}. Pick 2r uniformly 
distributed random integers si, S2, • ■ ■ ,Sr and ti,t2, ■ ■ ■ , tr in {1, 2, • • • , A^2} 
and consider the linear forms 

r r 

5 = ^ aiSi, r = ^ Ojij. 

i=l i=l 

It has been proved by Cooperman, Feisel, von zur Gathen and Havasin in 
[3] that with high probability 

gcd(ai,a2,--- ,0,.) = gcd(5',r). (3) 

This was improved recently by von zur Gathen and Shparlinski [5j and they 
gave the following strong result: with probability at least —^r + o(l), 

gcd(ai,a2,--- ,0^) = gcd(5,T), 

iV2 

provided that is large enough. 

r + m A''i 

This result can be used to produce a very efficient probabilistic algorithm 
for Chinese remaindering. Let us take = — . We can find x such that 

X = xi (mod mi) 
X = X2 (mod 777-2) 



X = Xr (mod rrir 



by the following steps: 
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1. Choose random linear forms S,T until 

gcd(5,r) = i. 

(The expected number for getting the desired pair of S, T is less than 
2.) 

2. Use extended Euclidean algorithm to get integers u, v such that 

Af 

uS + vT = > [usi + vti) — = 1. 



mi 



3. The solution x is 

x = > xAusi + vti) — (mod M). 
1=1 

Remark. It can be seen that in this routine, if the extended Euclidean 
algorithm is used to compute all gcds, then the expected number of rounds 
to get u, V in step 2 is less than 2. In step 3, usi + vti can be replaced by 
{uSi + vti) (mod rrii). 

3 An Improved NC^ Division Algorithm 

In this section, we discuss the division algorithm of Chiu, Davida and Litow 
[2] . A careful analysis enables us to reduce the number of prime moduli by 
a factor of log n. 

Let a be a real number. A rational number a' is said to be an n—hit 
under approximation to a if 

< a - a' < — . 

The next result improves the lemma 3.2 of [2]: 

Lemma 1 Let i < a < 1 and 8 = 1 — a. If 4—, -r-, ■ ■ ■ , o-re in + 

3) — bit underapproximations to (3, then 

n+l 



^ Ai ^ ^ y Ai 

1 



is an n—hit underapproximation to 

a 



5 



Proof. Let 

ri = min | — |. 

l<i<n+l Ai 

Note that < 3 < - and < d — ri < — -it , we see that 
~ 2 ~ 2""^""^ 



a A\ A1A2 Ai a 

i=l 



1 1-r] 



n+2 



1-/3 1-7] 
i-^ ^)+^ 



^n+2 



(1-/3)(1-??) 1-ry 



1 1 

_ 11 + 



11 1 
2 ■ 2 2 

1 



In [2], the log-depth, polynomial size, logspace-uniform circuit family 
for integer division was constructed by Chiu, Davida and Litow. In other 
words, integer division is proved to be in logspace-uniform NC^. This solves 
a longstanding open problem. 

Notice that the original construction of the NC^ circuit family for integer 
division needs 3n^ (actually 2n^ -|- 5n) primes numbers. The main purpose 
of this section is to refine the Chiu-Davida-Litow construction to achieve 

more efficiency. To be more specific, we shall show that h 3n primes 

logn 

will be sufficient. 

Theorem 1 The number of prime moduli of the Chiu-Davida-Litow NC^ 

integer division algorithm can be reduced to h 3n. 

logn 

Proof. The proof follows the similar line as in |2]. 

X 

The goal is: given x, y < 2", compute the CRR of — 

ly. 
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Let N = + 3n. 

logn 

Suppose that x, y are represented in a CRR system with base {mi, m2, . . . , m„} 
where rui is the (i + 2)th prime (mi > 3). This base is extended to 

{mi,m2, . . . ,mn,m„+i, . . .,171^}- 

A product D of the initial part of the base and some power of 2 will be 
constructed so that 

1 y 
-< — <!. 

2 - D 

According to [2j, if y = 2, set D = 2. If y > 2, then take j < n to be the 
number such that 

17111712 ■ ■ ■ rrij < y < mim2 • • • rajTiij-^-i. 

Let k be the smallest positive integer such that y < 2^mim2 • • • mj (therefore 
11 1 

> — ), and set 



2^mim2 - ■ ■ rrij 2' 



D = 2^mim2 ■ ■ ■ mj. 



Let r = I I . If n > 26, then ^ l°gj^ Oog^)' > 3, ^he fact that 
'-logn-' logn 



m„+i > 2n gives 



I " 

{TUn+iY > (2n)Liogr 

„ I n-logn-(logn)-^ 

> 2^^+^ (4) 
Since n + (n + l)r < N, we can form the following products: 
Ai = m„+im„+2 • • • run+r 



A, 
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We note that Ai > 2"+^ for i = 1, 2, . . . , n + 1, by ([3]). 
Next, choose 

I {D - y)Ai I 
ti = [ — J , for z = 1, 2, . . . , n + 1. 

Similar to [2j, can be computed in NC"*^. It is also routine to check that 

is an (n + 3)— bit underapproximation to /3 = — pr~- 
Ai D ^ 

Finally, by the lemma [U we get an n— bit underapproximation to — 

a 

where a = —: 

7 = 1 + -;- + ~A — IT "I ^ 



Ai A1A2 A1A2 ■ ■ ■ An+l 

Again, similar to [2], we have 

And all the computations are done in NC^. 

Remark. The Chebyshev bounds for primes can be used to get an inequality 
which is a bit sharper than the inequality ([5]), but there is no significant 
reduction on the number of prime moduli. 
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